Feature Selection based Semi-Supervised Subspace Clustering

نویسندگان

  • V. R. Saraswathy
  • M. Revathi
چکیده

Clustering is the process which is used to assign a set of n objects into clusters(groups). Dimensionality reduction techniques help in increasing the accuracy of clustering results by removing redundant and irrelevant dimensions. But, in most of the situations, objects can be related in different ways in different subsets of the dimensions. Dimensionality reduction tends to get rid of such relationship information and generate clusters which do not fully reflect the real cluster’s properties. Subspace clustering preserves such relationships by detecting all clusters in all subspaces. The accuracy of the subspace clustering results can be improved by making use of semi-supervised learning method. But finding subspaces by considering all input dimensions may decrease the clustering accuracy. This paper proposes a feature selection based semi-supervised subspace clustering method which applies feature selection in the beginning to eliminate unnecessary dimensions. Later, subspace clustering can be performed on the resulting dataset. This approach tends to improve the accuracy of resulting clusters since subspace clustering is performed on a reduced dataset. Experimental results show that the proposed method produces high quality clusters than semi-supervised subspace clustering algorithm. General Terms Dimensionality Reduction, Clustering.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Wised Semi-Supervised Cluster Ensemble Selection: A New Framework for Selecting and Combing Multiple Partitions Based on Prior knowledge

The Wisdom of Crowds, an innovative theory described in social science, claims that the aggregate decisions made by a group will often be better than those of its individual members if the four fundamental criteria of this theory are satisfied. This theory used for in clustering problems. Previous researches showed that this theory can significantly increase the stability and performance of...

متن کامل

Wised Semi-Supervised Cluster Ensemble Selection: A New Framework for Selecting and Combing Multiple Partitions Based on Prior knowledge

The Wisdom of Crowds, an innovative theory described in social science, claims that the aggregate decisions made by a group will often be better than those of its individual members if the four fundamental criteria of this theory are satisfied. This theory used for in clustering problems. Previous researches showed that this theory can significantly increase the stability and performance of...

متن کامل

A Subspace Clustering Model for Image Texture Segmentation

We propose a novel image segmentation model, called the Semi-Supervised Subspace Mumford-Shah model, which incorporates subspace clustering techniques into a Mumford-Shah model to solve texture segmentation problems. While the natural unsupervised approach to learn a feature subspace can easily be trapped in a local solution, we propose a novel semi-supervised optimization algorithm that makes ...

متن کامل

Subspace Scores for Feature Selection in Computer Vision

Feature selection has become an essential tool in machine learning – by distilling data vectors to a small set of informative dimensions, it is possible to significantly accelerate learning algorithms and avoid overfitting. Feature selection is especially important in computer vision, where large image vectors are often combined with huge synthetically generated feature sets. Inspired by recent...

متن کامل

Semi-supervised Clustering of Graph Objects: A Subgraph Mining Approach

Semi-supervised clustering has recently received a lot of attention in the literature, which aims to improve the clustering performance with limited supervision. Most existing semi-supervised clustering studies assume that the data is represented in a vector space, e.g., text and relational data. When the data objects have complex structures, e.g., proteins and chemical compounds, those semi-su...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013